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PROBLEM 


Obtain  design  procedures  and  criteria  for  reliable  Navy  communi¬ 
cation  in  a  hostile,  adverse,  or  unstable  environment,  through  use  of 
statistical  decision  methods  and  other  signal-processing  techniques. 
Specifically,  investigate  means  for  maintaining  a  constant  false-alarm  rate  in 
spite  of  changes  in  the  environment. 


RESULTS 

1.  Distribution-free  methods  of  signal  detection  can  provide  a  con¬ 
stant  false-alarm  rate  for  communications  systems. 

2.  The  specific  distribution-free  methods  described  here  are  easily 
implemented  and  analyzed. 

3.  Distribution-free  methods  require  only  slightly  higher  signal 
levels  (approximately  2-dB  SNR)  than  optimum  procedures  for  which  it  is 
assumed  that  the  probability  distribution  of  the  data  is  known. 


RECOMMENDATIONS 

1 . ;  Investigate  plans  for  future  communication  systems  for  possible 
application  of  distribution-free  procedures  in  signal  detection. 

2.  Investigate  the  possibility  of  implementing  distribution-free 
procedures  through  LSI  or  micro-circuit  techniques. 
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INTRODUCTION 

THE  ADVANTAGES  OF  DF  DETECTION 

Distribution-free  (DF)  methods  of  signal  detection  are  indicated 
whenever  the  distributions  of  the  signal  data  and  the  no-signal  data  are  unknown 
(ref.;  1-4] ..  Even  if  the  distribution  of  the  data  is  known  at  some  time, 
uncontrollable  phenomena  may  cause  changes  such  that  at  a  later  time  the 
distribution  will  be  vastly  different.  These  clu  nges  will  cause  a  procedure 
based  on  the  ‘known’  distribution  to  have  unknown  (possibly  unfavorable) 
characteristics,  whereas  most  DF  procedures  will  have  several  important 
constant  characteristics  in  spite  of  changes  in  the  distribution  of  the  underlying 
data.  Specifically,  DF  procedures  can  be  obtained  such  that  the  false-alarm 
rate  is  a  constant,  provided  only  that  the  distribution  of  the  underlying  data 
belongs  to  some  generally  large  class  of  distributions. 

A  detection  procedure  is  defined  as  distribution-free  over  the  class  5F  of 
distributions  if  the  probability  distribution  of  the  statistic  on  which  detection 
is  based  is  the  same  whenever  the  distribution  F  of  tjie  no-signal  data  belongs 
to  the  class  J.  As  will  be  seen  later,  the  test  statistics  described  here  are 
obtained  from  comparisons  of  data  from  two  sources,  one  of  which  may 
contain  a  signal.  Therefore,  when  a  signal  is  present  in  one  of  the  sources,  the 
distribution  of  the  test  statistic  will  depend  on  the  distributions  of  the  signal 
data  (from  the  source  containing  signal)  and  the  no-signal  data  (from  the 
other  source).  The  collection  of  data  which  might  contain  a  signal  is  called 
the  ‘possible-signal  sample’  (PSS),  and  the  collection  of  data  which  is  used 
for  comparison  purposes  is  called  the  ‘known-noise  sample’  (KNS). 

SCOPE  OF  REPORT 

Two  DF  procedures  are  described  in  this  report,  and  both  are  based 
on  the  rank-order  statistics  of  the  observed  data  (ref.  21 These  procedures 
are  compared  with  optimum  procedures  that  are  based  on  the  assumption 
that  the  probability  distributions  of  the  data  are  known.  Implementation  of 
the  procedures  is  discussed  and  particular  methods  ot  data  storage  and 
processing  are  described.  Also,  methods  for  obtaining  the  KNS  are  described. 


THE  DETECTION  PROBLEM 

The  detection  problem  considered  here  is  that  of  detecting  the 
presence  of  a  signal  at  the  output  of  a  communication  receiver.  This  includes 
receivers  that  are  used  for  the  purpose  of  intercepting  signals  solely  for  the 
knowledge  of  their  existence  (i.e.,  not  for  their  message  content),  and  receivers 
that  are  used  for  detecting  signals  from  which  message  content  is  to  be 
extracted.  Only  the  detection  aspect  is  considered;  extraction  of  information 
is  accomplished  by  further  processing.  This  detection  problem  occurs,  for 
example,  in  a  communication  system  that  precedes  each  message  with  a 
preamble  consisting  of  repeated  transmissions  of  a  narrowband  signal.  The 
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purpose  of  the  detection  procedure  is  to  determine  when  (or  whether)  the 
preamble  is  received  so  that  the  data-processing  or  recognition  stages  can  be 
applied  to  the  message  that  follows. 

Distribution-free  detection  procedures  are  applicable  to  these  problems 
since  a  constant  false- alarm  rate  is  maintained  even  if  the  probability  distri¬ 
bution  of  the  observed  data  changes  when  no  signal  is  present.  Procedures 
that  require  knowledge  of  the  probability  distribution  of  the  data  may 
produce  numerous  false  alarms,  especially  during  times  of  heavy  jamming  or 
interference.  The  false  alarms  may  cause  a  tie-up  of  equipment,  with  the 
result  that  the  actual  message  is  not  received.  For  example,  suppose  the 
system  utilizes  a  moving-window  detector  (MWD)  in  conjunction  with  a  time- 
lag  recorder  (TLR),  as  indicated  in  figure  1  [ref.  5] .  If  the  MWD  decision  is 
that  a  signal  is  present,  an  alarm  causes  the  data  from  the  TLR  to  be  read  out 
and  processed.  If  a  false  alarm  occurs,  the  TLR  is  tied-up  for  the  time 
necessary  to  read  out  its  contents,  and  any  message  which  occurs  during  this 
time  is  not  recorded. 


Figure  1.  A  system  utilizing  the  moving-window  detector. 


The  detection  problem  can  be  stated  in  statistical  terminology  as 
follows.  Let  x.j,j=l,  2,  . N,  be  independent  observations  of  the  output  X 

from  a  receiver,  considered  as  a  random  variable  with  distribution  function 
H(x).  Suppose  that  when  a  signal  is  present  at  the  receiver  output,  H(x)  =  G(x), 
and  when  no  signal  is  present,  H(x)  =  F(x),  where  both  G(x)  and  F(x)  are 
unknown.  (Several  assumptions  about  F(x)  and  G(x)  are  made  later.)  ?n  the 
basis  of  the  observations  xj,  j=l,  2,  .,  N,  we  are  to  choose  between  the 

hypothesis  Hg  that  H(x)  =  F(x)  and  the  hypothesis  Hj  that  H(x)  =  G(x).  We 
also  assume  that  a  ‘known-noise  sample’  (KNS)  y  j ,  y^j  is  available  such 
that  the  distribution  of  the  random  variable  Y  (of  which  each  yj  is  an 
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observation)  is  F(x).  That  is,  we  have  M  independent  observations  of  a 
random  variable  Y  that  has  the  same  distribution  as  X  does  when  no  signal  is 
present.  This  situation  is  depicted  in  figure  2,  in  which  a  simplified  block 
diagram  of  a  distribution-free  detection  system  is  shown.  Methods  for 
obtaining  the  KNS  are  discussed  later. 


Figure  2.  A  distribution-free  detection  system. 


For  the  case  of  an  MWD,  the  hypotheses  just  stated  apply  only  when 
the  window  is  filled  with  signal  data  (H  j )  or  when  the  window  is  filled  with 
no-signal  data  (Hq).  The  intermediate  situation  in  which  signal  data  are  just 
entering  or  leaving  the  window  is  a  mixture  of  Hq  and  H  j ,  and  leads  to 
complicated  computations  of  detection  probabilities  [ref.  5] .  For  the 
remainder  of  this  report  we  neglect  the  effect  of  the  MWD;  that  is,  we 
consider  only  the  situation  in  which  all  N  observations  are  either  possible- 
signal  data  or  known-noise  data.  The  probability  of  detection  that  is  computed 
is  a  lower  bound  to  the  true  probability  of  detection  for  an  MWD,  since 
several  opportunities  for  detection  are  neglected  (those  in  which  the  window 
contains  a  mixture  of  signal  data  and  no-signal  data).  Also,  if  the  results  given 
here  are  applied  to  an  MWD,  the  false-alarm  rate  is  N  times  larger  than  that 
stated,  since  the  MWD’s  decision  rate  is  N  times  larger  than  that  of  a  detector 
for  signals  of  known  arrival  times. 
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ASSUMPTION?  AND  EXAMPLES 


We  have  previously  assumed  that  tu’  random  variable  X  (the  PSS)  and 
the  random  variable  Y  (the  KNS)  are  such  that  X  and  Y  have  the  same  distri¬ 
bution  when  no  signal  is  present.  We  restate  this  assumption  and  make 
additional  assumptions: 

1.  The  random  variables  X  and  Y  have  continuous  distribution  func¬ 
tion  3(x)  and  F(y),  respectively. 

2.  Observations  on  X  and  Y  are  independent;  that  is,  (x  j ,  . .,  xn; 
y  1 ,  . .  .,  ym)  are  independent  for  any  ~i  and  m. 

3.  When  no  signal  is  present,  F(x)  =  G(x). 

4.  When  a  signal  is  present,  X  is  stochastically  larger  than  Y.  That  is, 
the  distribution  function  G(x)  of  X  is  such  that  G(x)  <  F(x)  for  all 
x,  with  strict  inequality  for  some  x. 

Assumption  1  guarantees  that  ties  (two  different  observations  equal) 
occur  with  probability  zero,  and  is  made  for  convenience  in  analyzing  the 
tests.  In  an  actual  application,  data  will  most  likely  be  processed  by  digital 
equipment,  and  the  occurrence  of  ties  is  a  possibility  that  must  be  acounted 
for.  The  assumption  of  independence  is  made  for  analytical  convenience 
(that  is,  so  that  the  tests  can  be  analyzed),  and  the  effect  of  having  this 
assumption  violated  is  usually  not  clear.  However,  in  most  detection  appli¬ 
cations  independence  can  reasonably  be  assumed,  especially  if  the  receiver 
input  is  passed  through  a  narrowband  filter  of  bandwidth  B  and  the  receiver 
output  is  sampled  at  instants  separated  by  intervals  larger  than  1/B. 

Assumption  3  requires  that  the  KNS  have  the  same  statistical  proper¬ 
ties  the  PSS  does  when  no  signal  is  present.  Later  we  describe  some  reasonable 
methods  for  obtaining  the  KNS.  In  general,  the  method  for  obtaining  the 
KNS  depends  on  the  type  of  signal  which  is  to  be  detected. 

Assumption  4  states,  roughly,  that  signal-plus-noise  observations  tend 
to  be  larger  than  observations  of  noise  only.  This  assumption  is  reasonable 
when  the  data  are  obtained  from  an  envelope  detector  (square-law  or  linear), 
and  in  many  other  cases,  including  the  case  of  an  additive  signal  in  noise  (not 
necessarily  Gaussian).  Examples  of  distributions  satisfying  assumption  4 
which  are  considered  here  are: 

A.  Both  F(x)  and  G(x)  are  Rayleigh;  that  is,  F(x)  =  1  -  exp[-x2/2 j 
andG(x)=  1  -exp[-x2/2(l+S)].  When  S  =  0,  F(x)  =  G(x). 

B.  F(x)  is  Rayleigh  and  G(x)  is  Rice;  that  is,  F(x)  =  1  -  exp[-x2/2] 
and  g(x)  =  dG(x)/dx  =  xexp[(-x2  +  a/)/2]  Iq  (ax),  where  Ig( )  is  the  modified 
Bessel  function  of  the  first  kind,  order  zero.  When  a  =  0,  F(x)  =  G(x). 

C.  G(x)  =  1  -  [  1  -  F(x)] u,  0  <  u  <  1 ,  where  F(x)  is  any  absolutely 
continuous  distribution  function.  When  u  =  1,  F(x)  =  G(x). 

D.  F(x)  is  normal  with  zero  mean  and  unit  variance,  and  G(x)  is 
normal  with  mean  n  >  0  and  unit  variance..  When  n  =  0,  F(x)  =  G(x). 
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Note  that  example  A  is  a  special  case  of  example  C,  with  u  =  1  /( 1+S). 
Therefore,  results  are  obtained  for  the  general  case,  example  C.  Also  note 
that  if  a  random  variable  U  has  one  of  the  Rayleigh  distributions  of  example 
A,  the  transformed  random  variable  W=U^/2  has  one  of  the  distributions 
F(w)  =  1  -  exp[-w]  and  G{w)  =  1  -  expf-w/(l+S>] ,  This  is  again  a  special 
case  of  example  C,  with  u  =  l/(!+S).  Thus,  results  obtained  for  example  C 
apply  to  both  linear  and  square-law  detection  of  a  (Swerling  case  2) 
fluctuating  signal  in  white  Gaussian  noise.  Examples  B  through  D  include 
three  of  the  situations  most  frequently  encountered  in  signal  detection 
problems:  envelope  detection  of  a  narrowband  signal  in  white  Gaussian 
noise,  envelope  detection  of  a  fluctuating  narrowband  signal  in  white  Gaussian 
noise,  and  coherent  signal  detection. 

The  presentation  of  these  examples  does  not  imply  that  the  DF 
procedures  described  here  are  applicable  only  to  them.  In  fact,  if  it  is  known 
that  any  of  the  examples  describes  the  true  situation,  then  the  optimum 
Neyman-Pearson  test,  based  on  the  known  distributions,  should  be  used.  As 
will  be  seen  later,  for  the  DF  procedures  described  here,  calculation  of  the 
probability  of  detection  requires  that  both  F(x)  and  G(x)  be  known  (i.e., 
assumed  known).  Such  calculations  will  be  made  only  for  the  examples 
given,  but  should  be  indicative  of  test  performance  for  other  distributions. 


THE  DETECTION  PROCEDURES 


The  detection  procedures  described  here  are  based  on  ranks.  Obser¬ 
vations  x  j ,  . . .,  Xfl  of  the  random  variable  X  (the  PSS)  are  made,  and  for 
each  xj,  k-1  observations  of  the  random  varible  Y  (the  KNS)  are  also  made. 
The  rank  Rj  of  Xj  with  respect  to  j  Xj;  y  j  j,  . . .,  y^.j  j  j  is  determined*  and 
the  statistic 

N 

W  =  £  BfRj)  (1) 

i=l 

is  compared  with  a  threshold  T,  (The  function  B  is  discussed  below.)  If 
W  >  T,  a  signal  is  claimed  present.  The  threshold  T  is  chosen  to  provide  the 
desired  false-alarm  probability.; 

The  two  procedures  that  are  discussed  here  are  obtained  from  two 
specific  choices  of  the  function  B  in  ( 1 ).  These  choices  are 


Bi(Ri)  =  Ri 


(2a) 


B2(Rj)  = 


0  if  Rj  <  k-r 
.1  if  Rj  >  k-r 


(2b) 


♦The  rank  of  Xj  is  defined  to  be  one  plus  the  number  of  y^’s  that  are  less  than  x;. 
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where  r  is  an  integer  between  0  and  k.  The  procedure  based  on  (2a)  is  called 
the  rank-sum  (RS)  procedure  (ref.  2] .  A  decision  is  made  that  a  signal  is 
present  if  the  sum 


N 

i=l 


(3) 


of  the  ranks  of  the  X  observations  exceeds  some  threshold  Tj.  The  method 
for  choosing  T  j  is  described  later. 

The  procedure  based  on  (2b)  is  called  the  rank-quantization  (RQ) 
procedure  (ref.  2] .  A  decision  is  made  that  a  signal  is  present :r  the  sum 

N 

W2  =  JB2<Ri>  (4> 

i=l 


exceeds  some  threshold  T2  which  is  determined  by  the  desired  false-alarm 
probability.  The  false-alarm  probability  also  depends  on  the  parameter  r  in 
(2b),  and  selection  of  r  is  discussed  later. 

We  note  here  that  when  Hq  is  true  -  that  is,  when  no  signal  is  present  - 
the  probability  distribution  of  Rj  is  independent  of  the  probability  distribution 
F(x)  of  the  data,  provided  only  that  F(x)  is  continuous.  This  is  verified  later, 
when  the  probability  distribution  of  R  is  derived.  Thus,  the  probability 
distributions  of  Wj  and  W2  are  also  independent  of  F(x),  and  both  procedures 
are  distribution-free  over  the  class  of  all  continuous  distributions. 

There  are  many  DF  procedures  that  could  be  applied  in  situations 
satisfying  assumptions  1-4  (ref.  1 1 .  However,  the  procedures  described  here 
have  the  attribute  of  ease  of  implementation,  and  this  is  not  true  of  many  DF 
procedures.  Also,  the  RQ  and  RS  procedures  are  easily  analyzed,  so  that 
evaluation  of  the  test  performance  is  not  based  on  asymptotic  results  or  on 
Monte  Carlo  simulation.  Thus,  while  other  procedures  may  have  better 
(theoretical)  performance,  it  may  be  impossible  or  impractical  to  analyze 
them  or  to  implement  them  for  a  real-time  operation. 


ANALYSIS 


Suppose  the  random  variable  X  has  a  continuous  distribution  function 
G(x)  and  the  random  variables  Y  j ,  . . .,  Y^j  are  identically  distributed 

with  continuous  distribution  function  T’(x).  Suppose  further  that  X,  Y  j , 

., .,  Y^.j  are  independent.:  Then,  the  rank  R  of  X  with  respect  to  jx,  Yj , 

, .. .,  Yjc_j  J  has  a  probability  mass  function  (pmf)  given  by 


P(R=j] 


(F(x)]J_I  ( l-F(x)]  k_j  dG(x) 


(5) 


j=l,2. 


k. 


m 


This  follows  from  the  number  of  combinations  for  which  R  -  j  and  the 
density  associated  with  each  combination  (ref.  6,  p.  43} .  Note  that  when 
F(x)  =  G(x)  (i.e.,  when  Hn  is  true),  the  substitution  u  =  F(x)  reduces  (5)  to 

P(R=jJ  =  l/k,j=l,2,  ...,k  (6) 

for  any  continuous  distribution  function  F(x).  Thus,  the  pint's  of  the 
statistics  Wj  and  W2  are  independent  of  F(x)  when  Hg  is  true. 

FALSE-ALARM  PROBABILITY 

To  determine  the  threshold  Tj ,  we  must  obtain  the  pmf  of  Wj  when  Hg 
is  true.  From  the  discrete  uniform  distribution  given  by  (6),  the  pmf  of  Wj 
is  obtained  by  an  N-fold  convolution.  The  threshold  Tj  is  then  determined 
such  that 


kN 

P0[W,>Ti]  =  J  P0[W,  =j!  =a  (7) 

j=Tl 


where  a  is  the  desired  false-alarm  probability,  (The  notations  Pg[  J  and  r  j  ( ] 
are  used  to  denote  the  probability  of  the  parenthetical  event  when  Hg  and  Hj 
are  true,  respectively.)  Since  Wj  is  discrete,  randomization  may  be  required 
to  obtain  specific  values  of  a  exactly,* 

For  the  RQ  case,  the  test  statistic  W2  has  the  binomial  distribution 


P(W2=j] 


(8) 


j  =  0,  1,  . . .,  n 


where  p  =  P(B2(R|)  =  1  ] .  From  the  definition  of  B2(Rj), 

k-r 

P[B2(Ri)=l]  =  l- ^P[Ri=jl  (9) 

j=l 

where  P[Rj  =  j]  is  given  by  (5).  Thus,  when  Hg  is  true,  Pg[Rj  =  j]  =  1/k  from 
(6),  and 

P0[32(Ri)=l]  =  r/k  (10) 


i 


•i 

4 


‘Randomization  is  accomplished  by  finding  the  threshold  T  such  that  Pq(W  >T  +  1  )<o 
and  Pq(W  >  T)  > a.  Then,  if  W>  T,  a  signal  is  claimed  present.  If  W  =  T,  a  random 
experiment  is  performed  with  two  possible  outcomes  A  and  Ar,  having  probabilities  P(A) 
and  1-P(A),  and  if  the  outcome  is  A,  a  signal  is  claimed  present.  The  probability  P(A)  is 
chosen  so  that  Pq(W  >  T  +  1 )  +  P(A)Pq(W  =  T)  =  a..  The  random  experiment  could  be  the 
result  of  generating  a  random  number. 


1 


From  (10)  and  (8).  the  fake-alarm  probability  is  determined  to  be 
N 

-“■"I 

j=T2 

Again,  randomization  may  be  required  for  some  values  of  a.  The  value  of  N 
is  usually  determined  from  practical  considerations  and  therefore  is  not 
considered  a  variable  in  (1 1).  Likewise,  the  parameter  k  is  not  considered  a 
variable.  Therefore,  the  fake-alarm  probability  is  a  function  of  r  and  T2,  for 
fixed  N  and  k.  This  fact  is  discussed  in  OPTIMIZATION  OF  THE  RQ 
PROCEDURE. 

DETECTION  PROBABILITY 

When  a  signal  is  present,  the  pmf  of  the  rank  Rj  of  Xj  is  given  by  (S). 
Thus,  to  find  the  probability  of  detection  for  the  RS  case,  we  need  only  to 
perform  an  N-fold  convolution  of  the  discrete  distribution  of  Rj  and  compute 

kN 

pd=yp,iw,=ii  02) 

j^i 

Calculation  of  the  probability  of  detection  requires  specification  of  both  F(x) 
and  G(x)  or  of  some  deterministic  functional  relation  between  them,  such  as 
is  given  in  example  C 

To  find  the  probability  of  detection  for  the  RQ  case,  we  use  (9)  and 
(5)  to  obtain 


“ 


jfle-r)^ 


(11) 


p1=P1[B2(Ri)=l]  (13) 

The  probability  of  detection  is  then  given  by 


which  depends  on  both  T2  and  r. 

For  examples  B  and  C,  equation  (5)  can  be  solved  in  closed  form,  and 
for  example  D,  a  solution  is  obtained  by  numerical  integration.  Tne  solution 
to  equation  (5)  for  the  Rice  case  (example  B)  is  given  by 


/k-i\ 

Pi(Ri=il  \H) 


j  [  i-e~x2/2] i_l  [e~x2/2ik“j 


.xeV  +  a2)/2io(ax)dx 


(15a) 
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(!5b) 


J k-;\  r  ..  /j  -l\  eipla^O-tn-Ufflro+k-i  *1)1 

v-'/ib  v"/  ”+k'i  +  l 

which  is  obtained  by  using  pair  22,  section  1 2.2,  page  74  of  Roberts  and 
Kaufman  l ref.  7], 

For  the  case  of  example  C,  the  solution  to  equation  (S)  is  [ref.  8] 

P,[Rj  =  k[=|u(k-l)!r(k-j+ti)|/|(k-i)!r(k+u)j,  j=l,  2,  .  ,k.  (16) 

Using (10)  and  the  relation  Hx+l)  =  xr(x),  we  can  obtain  Pj[Rj=jl 
;ecarsiv;ly  as 


P,[Ri=  11  =  u/(k  -1  +u) 

Pi  [Rj = J  +  1 1  =  Pj[Ri  =  JKH)/(k  -  j  -  1  +  u),  j  =  1, 2,  , , .,  k-I. 


OPTIMIZATION  OF  THE  RQ  PROCEDURE 

Optimization  of  the  RQ  procedure  is  defined  as  the  selection  of  the 
pair  0*2,  r)  of  thresholds  which  maximize  the  detection  probability  while 
the  faise-aiarm  probability  a  is  held  fixed  [ref.  9  j .  (Rea  l!  that  to  achieve  a 
specific  value  of  a  exactly,  randomization  may  be  required.)  In  the  maximi¬ 
zation  of  pj  it  is  assumed  that  N  and  k  are  constant;  results  given  later  show 
the  effect  of  increasing  N  and  k. 

The  probability  of  detection  pj  depends  on  the  solution  to  (5).  and 
(5)  depends  on  the  probability  distributions  of  both  the  signal  and  no-signal 
data.  Optimization  thus  requires  some  assumption  about  the  distributions 
associated  with  the  data;  however,  these  assumptions  in  no  way  affect  the  fact 
that  the  procedure  is  distribution-free.  The  false-alarm  probability  is  held 
fixed  independently  of  the  probability  distribution  of  the  no-signal  data. 

Optimization  is  accomplished  as  follows.  For  each  r.  r  =  1 ,  2,  , . k, 
equation  (1 1)  is  used  to  determine  the  threshold  T2  =  T2(r)  that  provides  the 
false-alarm  probability  closest  to  the  desired  one;  randomization  is  then  used 
to  achieve  or  exactly..  Also,  for  each  r,  p  j  (r)  is  obtained  from  ( 1 3)  by  using  (9) 
and  (5)  for  the  particular  distributions  F  and  G  that  are  assumed.  The 
resulting  pj(r)  is  used  in  (14),  together  with  the  threshold  T2(r)  determined 
above,  to  obtain  the  probability  of  detection  p^r).  The  value  of  r  and  the 
corresponding  threshold  T2(r)  for  which  p^Cr)  is  maximum  arc  chosen. 

Results  given  later  indicate  the  variation  of  p^  with  r  for  various  choices  of 
F  and  G,  and  show  the  importance  of  applying  the  optimization  procedure. 

The  optimization  procedure  just  described  requires  that  distributions 
F  and  G  be  assumed.  However,  one  reason  for  using  distribution-free  proce¬ 
dures  is  to  avoid  such  assumptions.  In  fact,  if  the  distributions  F  and  G  of 
the  no-signal  and  signal  data  are  known,  then  distribution-free  procedures  are 
inappropriate,  since  the  optimum  procedure  based  on  the  known  distributions 
can  be  used.;  Thus,  in  a  practical  application  it  is  necessary  to  repeat  the  above 


optimization  procedure  for  several  pain  of  distributions  Hedy  to  be 
encountered,  and  select  representative  values  for  r  and  Tjfr). 

As  mentioned  previously,  the  value  of  N  is  usually  determined  from 
practical  considerations;  however,  results  given  later  show  that  increasing  N 
increases  the  probability  of  detection.  Thus,  N  should  be  made  as  large  as 
practical.  Similarly,  the  size  k-1  of  the  KNS  should  be  made  as  large  as 
possible,  provided  that  the  optimization  procedure  just  described  is  used. 
Abo,  care  must  be  taken  to  ensure  that  increasing  the  size  of  the  KNS  does 
not  invalidate  any  of  the  four  assumptions  made  previously.  For  example, 
increasing  the  size  of  the  KNS  by  sampling  more  rapidly  may  invalidate  the 
assum prior  of  independence. 


OBTAINING  THE  KNOWN-NOiSE  SAMPLE 

The  KNS  can  be  obtained  by  sampling  in  the  frequency  domain,  in 
the  time  domain,  or  in  combinations  of  both.  The  objective  to  be  met  in 
choosing  a  method  for  obtaining  the  KNS  is  to  ensure  that  assumptions  1-4 
are  satisfied.  However,  it  is  usually  true  that  assumptions  1  and  4  are 
satisfied,  especially  in  cases  in  which  the  receiver  output  is  from  an  envelope 
detector.  Thus,  ensuring  that  the  observations  are  independent  and  identically 
distributed  random  variables  (assumptions  2  and  3),  in  the  absence  of  signals, 
is  most  important 

We  describe  two  methods  for  obtaining  the  KNS.  In  one  method 
sampling  is  in  the  frequency  domain  [ref.  10,  p.  69] ,  and  in  the  other  in  the 
time  domain  [ref.  10,  p.  118).  The  two  methods  can  obviously  be  combined 
to  include  sampling  in  both  time  and  frequency.  These  methods  are  intended 
only  to  indicate  how  the  KNS  can  be  obtained  in  practice.  More  elaborate 
methods  are  likely  to  be  required  in  many  systems. 

FREQUENCY  DOMAIN 

Suppose  the  signals  that  are  to  be  detected  are  replicas  S.(t),  i  =  1 , 

. . N,  of  the  narrowband  signal 

S(t)  =  Acos2xfQt,  0  <  t  <  T, 


=  0,  otherwise 

The  receiver  may  be  realized  as  a  narrowband  bandpass  filter  of  bandwidth 
B  *5  1/T,  centered  at  fg,  followed  by  a  square-law  rectifier  [ref.  1 1,  p.  63-64] . 
In  actual  application,  the  bandpass  filtering  is  usually  accomplished  in  the 
intermediate-frequency  (i-f)  amplifier  shown  in  figure  3,  in  which  a  super¬ 
heterodyne  receiver  is  diagrammed.  The  square-law  rectified  output  from 
this  filter,  sampled  at  equally  spaced  intervals  of  T  seconds,  is  the  PSS  xj, 

•  -  •’  XN- 

The  KNS  can  be  obtained  by  using  k-1  parallel,  nonoverlapping, 
narrowband  bandpass  filters,  each  of  bandwidth  B  and  centered  at  fj,  j  =  1 , 

.,  k-1 ,  as  shown  in  figure  4.  Tire  square-law  rectified  outputs,  sampled  at 
equally  spaced  intervals  of  T  seconds,  are  the  KNS  yjj,  j  =  1 ,  , k  -1 , .»  =  1 , 


V**’ 


? 


. . N.  The  filters  in  figure  4  can  be  realized  as  the  i-f  amplifien  of  k-1 
parallel  superheterodyne  receivers  that  have  a  common  r-f  amplifier,  each 
with  a  different  local  oscillator  frequency.  Thus,  with  the  exception  of  the 
local  oscillators,  the  circuitry  for  aD  k  receivers  (including  the  one  for 
obtaining  the  PSS)  would  be  identical. 

The  locations  of  the  center  frequencies  and  passbands  are  indicated  in 
the  figure  below  for  k  =  5. 


-jzzzzi _ Emm _ emu  r-m  r~m 

fl  f2  f0  f.  f4 

I  ■  B  —•■I 

If  the  duration  T  of  the  signal  Sf  t)  is  long,  the  bandwidth  B  is  small.  Thus, 
for  large  T  the  frequency  range  By  =  should  be  sufficiently  small 

that  the  assumption  of  identically  distributed  random  variables  (assumption 
3)  is  satisfied.  Similarly,  since  the  filter  passbands  are  nonoverlapping  and 
the  outputs  are  sampled  at  intervals  T  =  1/B,  the  assumption  of  independence 
(assumption  2)  should  be  satisfied. 

TIME  DOMAIN 

Suppose  that,  when  sent,  the  narrowband  signals  S|(t),  i=l, . . .,  N, 
of  duration  T  seconds,  are  transmitted  at  equally  spaced  intervals  of  kT 
seconds  as  shown  below.  Then  the  KNS  can  be  obtained  by  sampling  in 
the  time  domain. 

n  n  n 
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If  the  receiver  output  (for  example,  the  square-law  rectifier  output  in  fig.  3) 
is  sampled  at  equally  spaced  intervals  of  T  seconds,  then  every  sample 
will  be  possible-signal  data,  while  the  intermediate  k-1  observations  will  be 
known-noise  data.  This  presupposes  that  it  is  known  when  to  sample  to 
obtain  the  PSS.  If  this  knowledge  is  not  available,  a  moving-window  proce¬ 
dure  can  be  applied. 

A  method  for  simultaneously  obtaining  the  KNS  and  the  rank  Rj  of 
Xj  is  shown  in  figure  5.  The  square-law  rectified  output  from  the  i-f  amplifier 
is  fed  into  a  tapped  delay  line,  with  the  delay  between  taps  approximately 
T  =  1/B.  The  output  X(t)  from  the  first  delay-line  tap  is  compared  in 
comparator  Cj  with  Y j  ft),  in  comparator  C2  with  Y2(t),  etc.,  and  in  com¬ 
parator  C^i  with  Y^.j  (t).  The  output  Wj  (t)  from  comparator  Cj  is  1  if 
X  (t)  >  Yj  (t)  and  0  otherwise,  as  indicated  in  *he  inset  to  figure  5.  The 
outputs  Wj  (t)  are  summed  to  obtain  R(t).  and  R(t)  is  sampled  at  times  tj, 

i  =  1 . N,  Since  R(t)  counts  the  number  of  Yj  (t)  that  are  less  than  X(t), 

the  rank  Rj  of  Xj  =  X(tj)  is  R(tj)  +  1 . 
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RESULTS 


Results  are  presented  that  compare  the  DF  procedures  with  optimum 
procedures  based  on  the  assumption  that  the  probability  distributions  of  the 
data  are  known  (fig.  6-9).  For  the  purpose  of  presenting  these  results,  we 
define  a  signal-to-noise  ratic  (SNR)  for  each  of  the  examples  B  through  D. 
For  examples  B  and  D,  the  SNR  is  defined  to  be  a2/2  and  p2,  respectively, 
and  for  example  C,  the  SNR  is  defined  to  be  (l-u)/u.  For  examples  B  and  D 
the  definition  of  SNR  is 


sNR=E<xVjgg) 

E(Y2) 


(18) 


where  the  random  variables  X  and  Y  have  distribution  functions  G  and  F. 
respectively  [ref..  1 2,  15-3] ,  The  definition  of  SNR  for  example  C  results 
from  applying  ( 1 8)  to  example  A  and  using  the  relationship  u  =  1/(1  +S). 

Figure  6  shows  the  probability  of  detection  p^  (r)  as  a  function  of  the 
quantization  level  r  defined  in  equation  (2b).  These  data  illustrate  the  need 
for  performing  the  optimization  procedure  described  earlier.  The  curves 
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corresponding  to  the  normal  distribution  ue  relatively  flat  in  die  ndgjibor- 
hood  of  the  optimum  values  of  r,  indicated  by  the  arrows-  This  means  that 
the  detection  probability  is  affected  only  slightly  if  r  is  chosen  anywhere  m 
the  neighborhood  of  the  optimum  value.  However,  the  cunres  for  example  C 
show  that  if  r  is  chosen  only  slightly  larger  than  the  optimum,  the  probability 
of  detection  will  be  decreased  substantially.  A  similar  situation  (not  shown) 
is  also  true  for  example  B,  the  Rice  distribution. 


EXAMPLE  C  (RAYLEIGH) 


Figure  6.  Probability  of  detection  Pjfr)  as  a  function  of  quantiza¬ 
tion  level  r.  k=25,  N=25,  a=l  0"5. 

For  these  curves  it  is  also  seen  that  the  optimum  value  of  r  is  the  same, 
in  each  case,  for  SNR’s  differing  by  3  dB.  In  fact,  for  example  C  the  value 
r  =  2  is  optimum  for  -4-dB  to  6-dB  SNR,  and  for  example  D  the  value  r  =  5  is 
optimum  for  -5-dB  to  4-dB  SNR.  Thus,  the  optimum  value  of  r  is  essentially 
independent  of  SNR. 

Figure  7  compares  the  RQ  procedure  with  the  optimum  test  based  on 
the  assumption  that  the  distributions  are  known.  Also,  the  effect  of  increasing 
the  size  k-1  of  the  KNS  is  shown.  The  data  for  the  curve  labeled  ‘optimum’  in 
figure  7  were  obtained  from  Robertson  (ref.  13,  p.  765] ;  also,  these  data  are 
for  N  =  32.  (Robertson  [ref.  13]  presented  data  only  for  N  =  2^.)  From 
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SNR,  dB 


Figure  8.  Probability  of  detection  as  a  function  of  SNR-rank-quantization 
(equation  2a)  and  rank-sum  (2b)  procedures  N=25,  k=  100. 


these  figures  we  see  that  the  RQ  procedure  requires  only  approximately  2-3 
dB  more  SNR  than  the  optimum  procedures  while  providing  the  guarantee 
that  the  false  alarm  rate  is  constant. 

The  curves  labeled  ‘binary  integration’  in  figure  7  are  for  the  optimum 
test  that  utilizes  binary  quantization  of  the  data  and  assumes  the  distributions 
of  the  data  are  known  lref.;  9J .  Each  of  these  curves  is  an  upper  bound  to  the 
corresponding  RQ  curves.  The  RQ  curves  approach  this  upper  bound  rapidly 
as  k  increases;  thus,  the  RQ  procedure  performs  nearly  as  well  as  the  optimum 
test  based  on  binary  quantization. 

Figure  8  compares  the  RQ  and  RS  procedures  with  the  optimum  test. 
The  ‘optimum’  curve  for  figure  8 A  was  obtained  from  Robertson  [ref.;  13] . 

For  the  data  in  figure  8A  we  see  that  the  RS  procedure  requires  only  about 
1/2  dB  less  SNR  than  the  RQ  procedure.  Also,  the  RS  procedure  requires 
only  about  1  dB  more  SNR  than  the  optimum.  (The  optimum  curve  shown 
is  for  N  =  32;  the  curve  for  N  =  25,  with  which  comparisons  should  be  made, 
falls  to  the  right  of  it.) 

For  the  data  in  figure  8B  we  observe  the  apparent  anomaly  that  the 
RQ  procedure  performs  better  (requires  less  SNR  for  a  given  p^)  than  the  RS 
procedure.  This  anomaly  is  discussed  later,  and  the  reason  for  the  better 
performance  of  the  RQ  procedure  is  given. 

The  data  of  figure  9  show  the  effect  on  the  probability  of  detection  of 
increasing  the  size  of  the  KNS.  Detection  probability  increases  rapidly  as  k 
increases  from  2  to  about  20  and  then  levels  out.  Also  shown  are  the  proba¬ 
bilities  of  detection  for  optimum  binary  integration  [ref.  9] ;  these  probabilities 
are  upper  bounds  to  the  p^-versus-k  curves. 


Figure  9.  Probability  of  detection  as  a 
function  of  size  of  KNS  -  rank- 


quantization  procedure  (equation  2a). 
Example  B  (Rice);  N=50,  <*=  1(H*. 
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The  apparently  anomalous  result  in  figure  8B,  in  which  the  RQ  proce¬ 
dure  is  shown  to  perform  better  than  the  RS  procedure,  can  be  explained  as 
follows.  The  optimum  test  based  on  ranks  calculates  the  statistic 


v  = 

.  =  >  log - 

a  w-m 


(19) 


where  rj  is  the  value  of  the  random  variable  Rj.  The  statistic  L  is  the  sum  of 
N  independent  and  identically  distributed  (IID)  random  variables 
PlfRi  =  r|) 

Z  =  jog - ;  each  of  which  can  take  one  of  the  k  values 

P0[Ri  =  ri] 

Z  =  log  kPj  [R  =  jl  ,  j  =  1,  2,  . . k.  If  we  normalize  the  random  variable  Z 
so  that  its  minimum  possible  value  is  0  and  its  maximum  possible  value  is  1 , 
we  obtain  a  new  random  variable  Z'  with  values  between  0  and  1 .  If  it  is 
assumed  that  F(x)  and  G(x)  are  related  as  in  example  C  (with  some  fixed  value 
of  u),  the  possible  values  for  Z'  are  as  shown  in  figure  10  (labeled  ‘optimum’). 


Figure  10.  Value  of  normalized  test  statistic  as  a  function  of  rank 
value  R.- 


By  a  similar  normalization,  the  RS  statistic  Wj  in  equation  (3)  can  be  trans¬ 
formed  into  the  sum  of  N I1D  random  variables  having  the  values  between  0 
and  1  indicated  by  die  diagonal  line  in  figure  10.  The  value  of  the  random 
variable  B2(Rj)  that  is  summed  in  equation  (4)  (RQ)  is  shown  in  figure  10  as 
the  solid-line  step  function.  Notice  that  the  RQ  step  function  more  nearly 
approximates  the  optimum  curve  than  the  RS  diagonal  line;  thus,  the  RQ 
procedure  performs  better  than  the  RS  procedure  for  this  case.  Alternatively, 
similar  curves  are  shown  in  figure  10  for  the  RQ  and  optimum  rank  test  for  the 
normal  distribution  (example  D).  We  notice  that  the  diagonal  line  for  the  RS 
procedure  is  a  good  approximation  to  the  optimum.  This  is  corroborated  by 
the  data  of  figure  8C,  which  show  that  the  RQ  procedure  is  almost  as  good  as 
the  optimum  test  (not  based  on  ranks). 


CONCLUSIONS 

Two  distribution-free  procedures  have  been  analyzed,  and  each  can  be 
applied  in  signal  detection  problems  with  little  loss  in  signal  detectability 
compared  with  optimum  procedures  that  require  knowledge  of  the  probability 
distribution  of  the  observed  data.  The  procedures  are  easily  implemented  and 
analyzed  and  should  be  applicable  in  many  existing  and  future  Navy  commu¬ 
nications  systems. 


RECO  EMENDATIONS 

1 ,  Investigate  plans  for  future  communication  systems  for  possible 
application  of  distribution-free  procedures  in  signal  detection. 

2.  Investigate  the  possibility  of  implementing  distribution-free  proce¬ 
dures  through  LSi  or  micro-circuit  techniques. 
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